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BACKGROUND OF THE INVENTION 

Field of the Invention 

[0001] This invention relates generally to microphone systems, and, 
more specifically, to highly directional microphones providing a flat frequency 
response. 

Background Information 

[0002] In the reception and recording of sound, there are many 
applications when is useful to have directional microphones. The standard technique is 
to rely on the directional response of microphone that is itself directional, such as a 
pressure gradient or "shotgun" type microphone. These microphones are limited both 
in the directionality of response and in the flatness frequency response. Various aspects 
of directional microphones of "classical" design are discussed in a number of articles, 
such as: Harry F. Olson "Directional Microphones," Journal of the Audio Engineering 
Society, October 1967, and B.R. Beavers, R. Brown "Third-Order Gradient 
Microphone for Speech Reception" Journal of the Audio Engineering Society, 
December 1970. These two articles are included in "Microphones: An Anthology of 
Articles on Microphones from the Pages of the Journal of the Audio Engineering 
Society" Publications office of the Audio Engineering Society (1979), which is hereby 
incorporated by this references. 

[0003] In a series of articles dating from the early 1970's, Michel 
Gerzon suggested using cancellation between two adjacent microphones to achieve 
high directionality in a limited frequency range. This is described in a series of articles: 
"Ultra-Directional Microphones: Applications of Blumlein Difference Technique: Part 
1" Studio Sound, Volume 12, pp434-437, October 1970; "Ultra-Directional 
Microphones: Applications of Blumlein Difference Technique: Part 2" Studio Sound, 
Volume 12, 501-504, November 1970; and "Ultra-Directional Microphones: 
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Applications of Blumlein Difference Technique: Part 3" Studio Sound, Volume 12, 
539-543, December 1970, which are all hereby incorporated by reference. This is also 
similar to the techniques used in certain aspects of phased-array radar. By combining 
the output of the microphones, the interference between the outputs adds constructively 
in a direction perpendicular to the axis connecting the microphones, but cancels to a 
varying degree in other directions. 

[0004] Although this results in a high degree of directionality to the 
response, it is highly dependent upon the relation between the microphones' spacing 
and the frequency of the sound. Although radar and other applications only require 
sensitivity in a fairly narrow frequency range, audio applications may require that the 
frequency response be flat over a sizable portion of the audio range. 

SUMMARY OF THE INVENTION 

[0005] The present invention provides a highly directional audio 
response that is flat over five octaves or more by the use of multiple colinear arrays 
followed by signal processing. In a preferred embodiment, each of the colinear arrays 
has a common center, but a different spacing so that it can be used for a different 
frequency range. The response of the microphones for each spacing are combined and 
filtered. The frequency response of each filter is selected so that when the filtered 
responses are added, this combined response is flat over the selected frequency range. 
The size and limits of the selected frequency range are not limited and can be extended 
by increasing the number of arrays and filters used. 

[0006] To improve the response, the output of the microphones for a 
given array spacing can also be filtered with windowing functions. This helps reduce 
the array response for directions not directly in front of the array. To receive the 
response from other directions a "steering" delay may also be introduced in the 
microphone signals before they are combined. The microphone signals may either be 
supplied directly from the microphones or have been previously recorded from the 
microphones' outputs. 

[0007] The invention also extends to two and three dimensional arrays. 
By introducing arrays with several regular spacings in two or three dimensions, the 
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response can centered in any direction. In one embodiment, a two-dimensional 
microphone array "fabric" is composed of a grid of combined transducer, preprocessor, 
and network interface units. 

[0008] Additional aspects, features and advantages of the present 
invention are included in the following description of specific representative 
embodiments, which description should be taken in conjunction with the accompanying 
drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0009] Figure 1 shows a linear array of microphones with a spacing of 

d. 

[0010] Figure 2 shows the amplitude of the response of the sum of all 
the feeds from the microphone array with changing angle of incidence for different 
wavelengths. 

[0011] Figure 3 shows the effect of "steering" the array by adding a 
simple delay to each microphone. 

[0012] Figure 4 shows the effect of using a window function to change 
the tradeoff between center lobe width and side lobe suppression. 

[0013] Figure 5 shows three overlapping arrays sharing center 

microphones. 

[0014] Figure 6 is a plot of Beta parameter to Kaiser-Bessel window for 
values of wavelength in multiples of the microphone spacing. 

[0015] Figure 7 shows lobe widths after normalization by adjusting the 
Beta parameter of the Kaiser-Bessel window. 

[0016] Figure 8 are typical windowing gain curves representing 
particular points of the Kaiser-Bessel window as the Beta parameter is swept as shown 
in Figure 6. 

[0017] Figure 9 is a block diagram of processing for overlapped 
microphone arrays. 

[0018] Figure 10 shows the response of one kind of prototype overlap 
filter covering the band from 2000 Hz to 4000 Hz. 
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[0019] Figure 11 is a diagram of a pressure-gradient condenser 

microphone. 

[0020] Figure 12 shows a regular 2-dimensional array with equal 
resolution in horizontal and vertical directions. 

[0021] Figure 13 is a 2-dimensional microphone array showing unequal 
resolution in vertical and horizontal directions. 

[0022] Figure 14 shows two 2-dimensional arrays placed at right angles. 

[0023] Figure 15 shows an embodiment of the processing for a 
microphone in the array. 

[0024] Figure 16 shows an embodiment including the preprocessing and 
A/D conversion in the same physical location as the microphone capsule itself. 

[0025] Figure 17 shows an embodiment as a microphone array "fabric". 

DESCRIPTION OF REPRESENTATIVE EMBODIMENTS 
[0026] The discussion starts with an array of microphones placed at 
equal distances along a line, as shown in Figure 1. Let d be their separation. Let a 
plane wave impinge on the array at an angle of 6 from the perpendicular to the array. 
Assume that the plane wave is a sinusoid with a wavelength of X. If n is the number of 
microphones, then the response to the plane wave in microphone k can be written as 
follows: 

c kd 

(1 ) sin(27r — (f + — sin 0)) 

X c 

For convenience, let the number of microphones be odd, and call the center microphone 
number zero. The discussion readily extends to the even number case, although the odd 
case is presented more fully here as it allows a greater degree of microphone sharing 
between different spacing in arrangements such as Figure 5. The variable t represents 
time in seconds. If these signals are summed over all the microphones and simplify, 
the following is obtained: 

c f (n_I)/ 2 kd ] 

(2) sin(2^r — 0|l + 2 ^ cos(2tt — sin 0)j 
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The second term of the above represents the amplitude of the resulting sum. This is 
plotted for various values of wavelength in Figure 2, that shows the amplitude of the 
response of the sum of all the feeds from the microphone array with changing angle of 
incidence. Each curve represents a different wavelength from l.5d (narrowest) 201 to 
6d (widest) 210. Note that the maximum response is developed in a direction 
perpendicular to the microphone array. The varying width of the response maximum 
show that different wavelengths will have different pickup patterns. 

[0027] The entire array can be "steered" by applying a simple delay to 
each microphone as follows: 

-kd 

(3) At k = sin^ , 

c 

where cp is the angle where the greatest sensitivity is desired. 

[0028] This has the effect of moving the maximum of the response of 
the array, but it also changes the width of the center lobe. Figure 3 shows the effect of 
"steering" the array from -45° 305 to 45° 303, with curve 301 showing ^=0°. The 
wavelength of the test signal was set to a constant 2.5 d. Note that the main response 
widens a bit as the array is steered away from the center. This is because the 
"effective" microphone spacing is reduced by the cosine of the angle. 

[0029] Since the amplitude term in equation (1) resembles a Fourier 
series, the use of window functions can change the tradeoff between center lobe width 
and side lobe suppression. Figure 4 shows the effect of changing the strength of the 
window. The window was the Kaiser-Bessel window with the p parameter varying 
between 0.5 in curve 401 and 5.5 in curve 403, where lobe width increases with 
increasing window strength. More information on window functions is given, for 
example, in Leland B. Jackson "Digital Filters and Signal Processing," Kluwer 
Academic Publishers, Hingham, MA USA, 1986 - see Section 9.1, ppl28-134, which is 
hereby incorporated by this reference. 

[0030] So far, this is discussion is based on that from phased-array radar 
technology, described, for example, chapter 7 of "Radar Handbook" by Merrill I. 
Skolnik, McGraw-Hill, Inc., 1990, which is hereby included by reference. To make 
this more useful for audio, the system should preferable produce uniform lobed width 
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over the relevant frequencies and achieve a flat frequency response over five or more 
octaves, preferably a 10-octave range of roughly 20 Hz to 20 kHz. The reason for 
uniform lobe width is to reduce the coloration of the sound in the principal direction of 
the array. Since the array depends on cancellation and reinforcement of the wave 
fronts, it is necessarily a highly frequency-dependent process and is preferably followed 
with sufficient processing to minimize the frequency dependencies. 

[0031] The basic array exhibits reasonable response over about 2 
octaves covering wavelengths from about l.5d and 6d. Wavelengths longer than this 
produces very wide principal lobes, and wavelengths shorter than this produce multiple 
principal lobes. The center octave of this (in a geometric-mean sense) can be taken as 
the main region of response, which is from about 2. 12 d to about 4A4d. The remainder 
of the response range will be used to overlap with other arrays that cover other octaves. 

[0032] A wide response can be obtained by having multiple arrays on 
the same line with the same microphone in the center. Figure 5 shows a simplified 
diagram with three colinear arrays with spacings at d, 2d and 4d and five microphones 
for each spacing. For example, microphone 503 has both the spacings d and 2c? and 
microphone 502 has both the spacings 2d and Ad. To cover the full audio range with 
equal spatial resolution, an exemplary embodiment would have a total of ten array 
spacings. Each array will contribute one octave of frequency response to the overall 
result. The upper and lower half-octave of each array will overlap with the adjacent 
arrays. 

[0033] The next aspect to be addressed is control of the width of the 
principal lobe. As noted above, a window function can be used to adjust the width of 
the center lobe. Since a different lobe width is preferably used at each different 
frequency, the output of each array is filtered with individual filters that are designed to 
realize a certain window function at each frequency. The filters should also sum 
properly with the responses of adjacent arrays to produce flat frequency response and 
uniform lobe width when summed over all the arrays. 

[0034] Since window functions make the lobe wider, it is preferable to 
take the widest lobe width and match all the other widths to this. The widest lobe in the 
range of interest occurs at 6d. A simple optimization can derive values of the beta 
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parameter of the Kaiser-Bessel window that give us the desired window width. Figure 
6 shows the result of such an optimization. Figure 6 is a plot of the beta parameter to 
Kaiser-Bessel window for values of wavelength expressed in multiples of the 
microphone spacing. These values of beta equalize the main lobe widths for the given 
wavelength. This curve appears to be largely independent of the number of 
microphones in the array. As the wavelength moves from 6d down to l.5d, the beta 
parameter can be increased steadily to widen the principal lobe. 

[0035] Figure 7 shows the result of applying different window functions 
to the array at different wavelengths and shows lobe widths after normalization by 
adjusting the Beta parameter of the Kaiser-Bessel window. The wavelengths span the 
range from l.5d to 6d. Note that the sideband gain increases at the ends of the 
frequency range due to the windowing. This is using 15 microphones in a single array. 
Note that at the shortest wavelength, the sideband rejection starts to rise again, probably 
due to the effective "shortening" of the array. 

[0036] Figure 8 is a typical windowing gain curves for four 
microphones in a 9-microphone array at various values of wavelength (in multiples of 
d). These represent particular points of the Kaiser-Bessel window as the Beta 
parameter is swept as shown in Figure 6. The upper curve represents the center 
microphone, and the center point of the window function. 

[0037] There is nothing particularly special about the Kaiser-Bessel 
window. It is used here simply because it comes with a single parameter that controls 
the width of the window in a smooth, continuous, and monotonic fashion. One could 
equally derive an "optimum" window by a least-squares technique. This would allow 
"fine tuning" the response at any given frequency by adjusting the tradeoff between 
matching the center lobe to the prototype response (which is the response at the longest 
wavelength, 6d) to the off-axis response. Note in Figure 6 that the off-axis peaks get 
greater as the wavelength gets longer. This is to be expected, since smaller values of 
Beta allow the sidelobes to increase in amplitude. Defining a window function, w k , 
then define a weighting function at each angle as p i . An objective function can then 
be described as follows: 
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(4) F = % Pi \D t -\-2 £ w ft cos(2* — sin^.)| 

where D t represents the "desired" response. In the present example case, a desired 
response can be produced by windowing the response at the maximum wavelength of 
6d. Using this as the prototype response, this can be matched as closely as desired by 
choosing the weighting function, p i , and finding the window function coefficients, w k , 
that minimize F in equation (4). Since the response of the array is linear with respect to 
any given window coefficient, equation (4) represents a linear least-squares problem. 
The normal equations can be formed and solved by any number of methods, such as 
singular-value decomposition (described, for example, in sections 2.5 and 8.6 of Gene 
H. Golub, Charles F. Van Loan "Matrix Computations: Third Edition" Johns Hopkins 
University Press, Baltimore MD USA, 1996, which is hereby incorporated by 
reference). One might choose, for instance, p i = 1 to match the desired response as 
well as possible over the entire function. One might choose p i =10 over the main lobe 
and p i = 1 elsewhere to force the response to match the desired response as well as 
possible at the main lobe and less well outside the main lobe. 

[0038] Since the Kaiser-Bessel window is relatively simple, this 
embodiment is used in the remainder of this discussion with the understanding that any 
suitable window that allows matching of the principal lobes can be used. 

[0039] To implement a window function that varies with frequency, a 
filter is implemented for each microphone that has the desired gain at each wavelength. 
This gain is determined by the value of the Kaiser-Bessel window for that microphone 
at the value of beta indicated by the curve of Figure 6. The resulting window function 
is, in fact, a family of window functions, since the window function will be different for 
each different frequency. This can be represented this as w k (A) for the weighting of 
microphone k at a wavelength of X. Figure 7 shows a plot of four different microphone 
coefficients as functions of wavelength. These represent the filters that must be 
realized to produce equal main lobe widths over the frequency range of interest. There 
are many ways to calculate the filter coefficients, such as the methods described in 
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Leland B. Jackson "Digital Filters and Signal Processing," that was incorporated by 
reference above, or either of J.H. McClellan, T.W. Parks, L.R. Rabiner "A Computer 
Program for Designing Optimum FIR Linear Phase Digital Filters" IEEE Transactions 
on Audio and Electroacoustics, Volume AU-21, pp506-526, December 1973, or 
Andrew G. Deczky "Synthesis of Recursive Digital Filters Using the Minimum p-Error 
Criterion" IEEE Transactions on Audio and Electroacoustics, Volume AU-20, pp257- 
263, October 1972, which are both hereby incorporated by reference. Since a filter will 
respond over the entire range, it is not necessary to specify the curves outside of the 
range shown in Figure 7. It is sufficient to just extend the curves to zero frequency and 
the Nyquist rate by simply duplicating the values at the end points shown in Figure 7. 
That is, the response of the filter at wavelengths greater than 6d can have the same 
response at a wavelength of 6d, and wavelengths shorter than l.5d can have the same 
response as at a wavelength of \.5d. These values are somewhat arbitrary but are 
sufficient to produce a working design. 

[0040] Note that window functions are symmetric. This means that for 
an array of n microphone, only (n-l)/2 windowing filters need be implemented. 
Microphones on each side of the center microphone may be summed before filtering, 
thus eliminating the need for a number of filters, although the steering delays will differ 
for the two sides. 

[0041] Figure 9 is a block diagram of processing for overlapped 
microphone arrays in an exemplary embodiment with two spacings, each having five 
microphones. Each microphone goes to a filter that implements the frequency- 
dependent window and the "steering" delay, if these are included. For example, 
microphone 901, which corresponds to a spacing 2d, goes into windowing filter 915. 
Microphone 902, which corresponds to a spacing of both d and 2d, goes to two 
windowing filters, being connected to adder 930 for the spacing d through the filter 930 
and being connected to adder 93 1 for the spacing 2d through the filter 916. 

[0042] Each windowed array is then filtered so that the arrays overlap 
properly to produce an overall flat response when combined by adder 960. Here, the 
array with the spacing d is filtered through overlap filter 950 after the windowed 



9 



responses are combined in adder 930, with filter 951 and adder 93 1 serving the function 
for the array with spacing 2d. One windowing filter is shown for each microphone for 
clarity. Since the window functions are symmetric, pairs of microphones equidistant 
from the center microphone, for example 901 and 907, could be summed (after 
receiving the appropriate steering delay), then filtered by a single frequency-dependent 
window filter so that, in the case of 901 and 907, filters 915 and 919 would then be the 
same filter. If it is desired to simultaneously receive signals from different directions 
(that is, with the array "steered" to different angles), then separate processing would 
have to be supplied for each desired angle. Of course, the direct microphone feeds 
could be stored and processed to extract signals at different angles at a later time. 

[0043] As noted above, each array covers about two octaves. This can 
be separated into the main region, from about 2.\2d to about A.lAd, and the overlap 
regions, which constitute the remainder of the full two octave range. At the extremes 
of the frequency range, there is no overlap, so the highest array will cover up to \.5d r 
and the lowest array will cover down to 6 d x , where dj represents the microphone 
spacing of array j. Using 24 kHz as the highest frequency for which coverage is 
desired and using the spacings d, 2£?,...,2 (N ~ 1) <i, this results in setting the spacing of the 
microphones in the highest frequency array as about 1 cm. From this, the results of 
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Table 1 can be derived: 
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High 


Spacing 


Frequency 


Frequency 




8000 H7 


22067 Hz 


2 cm 


4000 Hz 


8000 Hz 


4 cm 


2000 Hz 


4000 Hz 


8 cm 


1000 Hz 


2000 Hz 


16 cm 


500 Hz 


1000 Hz 


32 cm 


250 Hz 


500 Hz 


64 cm 


125 Hz 


250 Hz 


1.28 m 


62.5 Hz 


125 Hz 


2.56 m 


22.11 Hz 


62.5 Hz 



Table 1 



More generally, if the minimum spacing is taken to be centered at a frequency of, say, 
3-20 kHz, this corresponds to a d in the range of about 10cm> d>0.5cm. 

[0044] The frequencies of Table 1 are not exact, but have been rounded 
to convenient boundaries for clarity. Note again that the highest frequency array 
extends from 1.5<f to A.lAd, and the lowest frequency band extends from 2.\2d to 6d. 
All the others extend from 2.12<i to AAAd. This shows that the entire frequency range 
may be captured by 9 collinear arrays, each having twice the spacing of the next. If 
desired, the larger arrays at lower frequencies may be eliminated. The only effect of 
this is that the pickup will not be highly directional at low frequencies due to the 
widening of the principal lobe of the array response. 

[0045] Note again that steering the array away from angle zero (straight 
ahead) does have the effect of widening the principal lobes, since it lowers the effective 
distance between the microphones. This table was computed at angle zero. Alternately 
the table can be based on a different angle. To be as consistent as possible, it may be 
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preferable to compute a different set of frequency-dependent window functions for 
each desired pickup angle so that the principal lobe width would be constant over the 
entire steering range of the array, which is from -45° to 45°. For many applications, 
however, it is acceptable to allow the width of the principal lobe to change, as long as 
other properties of the array are preserved, such as overall frequency response flatness, 
and matching of the principal lobes among the arrays to prevent coloration of the sound 
in the principal lobe. 

[0046] In addition to the filtering described above to apply the 
frequency-dependent window function to each microphone in each array, there is a 
filter that is applied to the total response from a given array so that each array 
contributes to the overall response mainly in its principal frequency region. It is 
preferable that the sum of the responses across all the arrays be flat over the audible 
range. This can be expressed by considering the impulse response of each array, then 
stating conditions on these responses which represent the design goals. For 
convenience the impulse response of each array can be taken as symmetric. This is not 
strictly necessary, but it guarantees that there will be no phase variance from one array 
to the next. If the impulse response of filter i at a time point s is represented by h is , 
the conditions for flatness of overall frequency response can be stated as follows: 
^, \l s = 0 

(5) ,*<> 

This is necessary and sufficient to guarantee perfectly flat frequency response. In 
general, this condition will not be met exactly. All that is required is that the deviation 
from identity be sufficiently small so it is not heard as an excessive coloration of the 
sound. 

[0047] To compute the overlap filters, the process can start by first 
creating an "ideal" prototype filter that is constructed so that it overlaps perfectly, 
followed by computing approximations to the prototype filter using standard 
approximation techniques (see, for example, J.H. McClellan, T.W. Parks, L.R. Rabiner 
"A Computer Program for Designing Optimum FIR Linear Phase Digital Filters" 
incorporated by reference above). Although a separate prototype filter is preferably 
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created for each band, there are some similarities that make the process simpler. The 
process can separate the filters into the two at the extremes of frequency, and all the 
rest. For the filters that are not at the extremes, it can be required that they are 
identical, except that each band spans twice the frequency of the previous band. For 
example, if a particular frequency band goes from / to 2f, then a filter can be defined as 
follows: 

(6) / e =(4/3)/ 
(?) /,=(2/3)/ 
(8) / 2 -(8/3)/ 

0 3<f, 

(P-fiWc-fl) f^$<fc 
(fi-WVl-fc) fc*#<fl 

0 f 2 <9 
[0048] Figure 10 shows a plot of this function for the frequency band 
2000-4000 Hz. As noted, the filter extends down to 1333 Hz and up to 5333 Hz for 
proper overlap. It will perfectly overlap the filters in the next higher and next lower 
frequency bands, and the sum of these overlapping filters is exactly one by 
construction. The filter for the next higher or lower frequency band may be obtained 
simply by relabeling the frequency axis with either twice the frequencies or half the 
frequencies. Of course, this filter design is not unique. There are many suitable 
choices for the overlap filter that have this property. 

[0049] At the extremes of frequency, the filter can simply be taken to 
stay at unity gain on one side or the other. Using the definitions above, the filters for 
the extremes can be defined as follows: 

(10) H{9) = \{f 2 -9)l{f 2 -f c ) f c <&</ 2 

[ 0 f 2 <& 
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[ 0 3<f x 

(ii) H{&) = lo-AWc-A) f^&<f c 

I 1 fc*# 

[0050] The above description is somewhat careless with the notation, in 
that the above formulas all use the same symbols for the important frequencies ( f x , f 2 , 
and f c ), but this is intended them to apply just to the particular band of interest. As 
noted above, for the band from 2000 to 4000 Hz, /, would be 1333 Hz, and f 2 would 
be 5333 Hz. For other bands, these frequencies would be scaled appropriately to 
represent the frequency range of the particular band. As an example, in the lowest band 
as shown in the table above, f c would be 41.667 Hz, and f 2 would be 83.333 Hz. 
Equation (10) represents the lowest filter, which extends down to zero frequency. 

[0051] Having defined a suitable set of prototype filters for overlapping 
the microphone arrays, filter coefficients that approximate these filters to any degree of 
accuracy may be computed. If the filters are all of zero-phase, then they will sum to an 
approximation of an impulse,, described by Equation (5). This is by construction. 
Since the sum of all the prototype filters is unity, the resulting impulse response must 
be a simple impulse. Consequently, the sum of a series of filters that approximate the 
prototype filters will naturally be an approximation to an impulse. Of course, if the 
filters are not of zero-phase or linear-phase design, they will not necessarily sum to an 
impulse. 

[0052] It should be noted that as the array is steered so that the principal 
lobe is at a non-zero angle, the effective shortening of the microphone spacing by the 
factor of cos(6>) indicates that all the filters, both the windowing filters and the 
overlapping filters, should be recomputed using a microphone spacing of dcos(0). 
Additionally, the beta parameter of the Kaiser-Bessel window (or whatever window 
function is used) may be adjusted so that the width of the principal lobes remains 
constant over the usable steering range of -45° to 45°. 

[0053] There has been an implicit decision in the above to implement 
the frequency-dependent window function and the overlapping filter using FIR, or 
finite impulse-response filters. This is not strictly necessary, but it allows the use of 
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perfectly linear-phase filters. A linear-phase filter has an inherent delay in the signal 
path. If all the filters have the same number of multiplies, then they will all exhibit the 
same delay, and they may be summed. If the filters do not have the same number of 
multiplies, then the delays should be equalized before summing the results of the 
windowing filters. These delays can be offset by combining them with the delays 
necessary for "steering" the array (Equation (3)). If some microphones end up with 
negative delays, then all the microphones must be delayed to assure causality. 

[0054] So far, the directional characteristics of the individual 
microphones in the array have not been discussed. This discussion is perfectly accurate 
if the microphones are omni-directional. Some modifications to the exposition can be 
made to show the effect of directional microphones, such as the pressure-gradient type. 
Figure 11 shows a schematic representation of a pressure-gradient condenser 
microphone 1100. Typically, the neutral interior capsule 1107 is held at ground, and 
the variations of capacitance between the anterior and posterior diaphragms, 
respectively 1103 and 1105, and the capsule 1107 generate a voltage. To obtain 
directional characteristics, the voltages of the anterior and posterior diaphragms may be 
weighted and subtracted. This produces the familiar directional patterns, such as 
cardioid, hypercardioid, and so on. 

[0055] This kind of microphone has the following angular response: 
(12) C + (1-Qcos(0) 

The response straight ahead (zero angle) is exactly one. The response to the rear is 
(2C-1). For a cardioid pattern, C is set to one-half, so the response to the rear is exactly 
zero. Other values of C produce different patterns. 

[0056] The effect of using a pressure-gradient microphone in this array 
is that the off-angle response will be multiplied by the directional pattern described by 
Equation (12). The effect would be that, for instance, the plot shown in Figure 3 would 
also show an amplitude difference as the principal lobe was steered from left to right. 
All the curves in Figure 3 would be multiplied by Equation (12). Note that the peak 
amplitude of the principal lobes in Figure 3 can be normalized by simply correcting for 
the expected attenuation due to the directional characteristics of the microphones. 
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[0057] As noted in the work of Gerzon cited in the Background section, 
it is also possible to take the voltages from the anterior and posterior diaphragms 
separately, thus producing two separate feeds from each microphone. These can then 
be combined later to produce directional characteristics. For instance, one might 
weight the anterior diaphragm by one-half and the posterior diaphragm by minus one- 
half and sum them to produce a forward-facing cardioid pickup, with 100% rejection of 
sounds coming from directly behind. Alternately, one might weight the posterior 
diaphragm with one-half and the anterior diaphragm with minus one-half to produce a 
rear- facing cardioid pickup with 100% rejection of sounds coming from directly in 
front. I n this manner, a single array of pressure-gradient microphones can be used to 
mix the feeds of the diaphragms differently so that the same microphone array may be 
used for sounds in front of the array and behind the array with equal angular resolution 
and identical fidelity (frequency-response). Of course, filtering similar to that shown in 
Figure 9 would be duplicated for the rear-facing array. 

[0058] With phased-array radar, there is always the explicit assumption 
that the incoming wave is a plane wave. With the phased-array microphone, the plane 
wave assumption may be used when the sound sources are sufficiently distant from the 
microphone itself. If this is not the case, the wavefront will be curved. This curvature 
may corrected if the location of the sound source is known. If the plane-wave 
approximation can be made, the distance between the sound source and the array is not 
needed. 

[0059] To correct for the curvature of the wavefront, a correction is 
applied to the amplitude and to the arrival time. The amplitude correction is needed to 
offset the l/r 2 attenuation the wavefront experiences. The correction to the arrival time 
is necessary since the curvature will have the effect of delaying the off-center parts of 
the wavefront. This can be quantized as follows: Let 0 and r 0 be the angle and 
distance from the sound source to the center microphone of the array. The amplitude 
and time delay compensation is then: 

(13) P„ = T y(.2 =cos 2 # + fsin6>-H— J 
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where r n represents the distance from the sound source to microphone n. The feed from 
microphone n should be multiplied by P n and should be advanced by A w seconds. 

[0060] Since this correction is specific to the particular location of the 
sound source, it may be expected that the rejection of the off-axis sound would be 
affected and there may be more "leakage" from off-axis sounds when this kind of 
correction is applied. 

[0061] Note that when the sound source consists of a number of discrete 
sources at known angles and possibly known distances, then the response in a particular 
direction can be enhanced by subtracting off the signals from the known directions. Of 
course, the delays across the varying angles must be equalized before a signal from one 
angle can be subtracted from a signal from another angle. This can be though of as a 
kind of analog to the lateral inhibition found in optical receptors in the retina of the eye. 

[0062] So far in this exposition has operated under the implicit 
assumption that the microphones were identical. In practice this is, of course, not a 
valid assumption and there will be some mismatch. The effect of the mismatch can be 
examined to see what this requires of the microphones. 

[0063] A worst-case bound on the error in the array can be obtained by 
taking the second term of Equation (2), applying a window function, assuming that the 
cosine term is always unity, and assuming that the microphone error is a uniform factor 
of s. This gives the following upper bound: 

(15) M=elw 0 +2 gw,| 

The window function is normalized so that the above sum (across all the points of the 
window function) is unity, so the error is bounded by the individual microphone error. 
The parameter s can be taken to represent the expected value of the error. Some 
microphones will exhibit somewhat more error and some will exhibit somewhat less. 
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[0064] A mean deviation of 1 dB then will produce error in the resulting 
pickup pattern that is about 18 dB down. The error discussed here is a distortion of the 
pickup pattern itself, as shown in Figures 2, 3, and 4. This is not so important for the 
principal lobe, but it can make a significant difference in the sideband suppression, 
since in some cases, the error will be of the same order of magnitude as the sideband 
amplitude itself. It can be expected that the actual sideband rejection will be several dB 
less than the theoretical values with a ldB variation among the microphones. Of 
course, better matching will allow more sideband rejection. 

[0065] So far the discussion has only considered sounds coming from 
point sources that are in front of (or behind) the array. There may also be room 
reverberation, which can come from any direction. The room reverberation may 
(somewhat artificially) be divided into three epochs: the direct sound, the early 
reflections, and everything else. The direct sound and the early reflections can all be 
treated as point sources of sound. The array can be steered to pick up each one of these 
sources separately (or not, depending on the goals of the recording). The late 
reverberation can be considered to be omnidirectional, and will thus affect the array 
uniformly regardless of the steering direction. Of course, non-uniform reflections, such 
as slap echoes, will appear as specular reflections and thus will appear as point sources 
to the array. 

[0066] The discussion may also be extended to more general 
arrangements. To extend the phased-array microphone to three dimensions, it must 
first extended to two dimensions. This can be done by extending the array as shown in 
Figure 12. This shows a regular 2-dimensional array 1200 of microphones that is 
capable of steering plus or minus 45° in the horizontal direction and plus or minus 45° 
in the vertical direction. Note that for some applications, it may not be necessary to 
have the same resolution in the vertical direction as in the horizontal direction. Figure 
13 shows an array 1300 with higher resolution in the horizontal direction than in the 
vertical direction. Additionally, a more general arrangement need not use orthogonal 
axes to determine the spacing of the array. In this last case, the non-orthogonality can 
be compensated for in the signal processing. 
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[0067] A single 2-dimensional array can only be steered across about a 
90° range in the forward direction and a 90° range in the reverse direction. To allow 
steering through the full 360° range, multiple non-coplanar 2- dimensional arrays may 
be used. The simpler case 1400 of two arrays at right angles is shown in Figure 14. 
Note that for this to work best, each array would preferably be acoustically 
"transparent", so that off-axis sounds will easily pass through it to reach the other array. 

[0068] To extend the array to three dimensions, two 2-dimensional 
arrays shown in Figure 14 can be taken and another array in the horizontal plane placed 
to cover the vertical direction. In this manner, pickup in any direction can be achieved. 

[0069] There is a wide range of ways to implement the array, depending 
on the goals of the implementation. One embodiment of the array would be to simply 
connect wires to each transducer in the array and run all the wires to the required 
processing hardware, with preprocessing for each transducer in the form of a 
microphone preamplifier and an A/D converter. Figure 15 shows the processing for 
each microphone in the array in such an embodiment. In the direct implementation of 
Figure 15, the array has a wire from each microphone 1501 in the array to the required 
preprocessing, including microphone preamplifier 1503 and A/D converter 1505. The 
output, along with that from other microphones in the array, then goes on to subsequent 
processing as shown in Figure 9. 

[0070] Of course, different technology can affect the elements in the 
figures. For instance, the use of electret or other microphone technology may render 
the pre-amplifier unnecessary. Similarly, it is possible to combine the microphone 
preamplifier (if any) with the first stage of the A/D converter. In any case, the result of 
the preprocessing is a sequence of digital audio samples. Since a large array may 
contain hundreds of microphones, running individual wires from each microphone to 
the required pre-processing and subsequent processing may be undesirable. 

[0071] With modern technology, high-levels of integration are possible. 
Both analog and digital circuitry can be put into the same package, if not the same 
substrate. See, for instance, US patent 5,051,799 of Paul et al, issued September 24 
1991, which is hereby incorporated by reference. It is possible to produce a very 
compact realization of the preamplifier and D/A converter. It is even possible to 
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combine the microphone preamplifier with the first stage of the D/A converter for even 
a more compact realization. Such circuitry can be on the order of the same size as the 
microphone capsule or even smaller. Figure 16 shows the idea of including the 
preprocessing and A/D conversion in the same physical location as the microphone 
capsule itself. 

[0072] In Figure 16, the microphone capsule integrates the microphone 
1601 with the pre-processing as the integrated pre-processor 1600. In this 
configuration, miniaturized preamplifier 1603 and A/D stages 1605 are integrated with 
some kind of multiplexing (network) interface that combines the signal with those of 
the other microphones. In addition, some kind of data multiplexing circuit is included 
with each microphone so that the outputs of multiple microphones may be combined 
into a single wire. A wide range of multiplexing technology may be used, ranging from 
simple time-domain or frequency-domain multiplexing (see, for example, US patent 
4,922,536 of Hoque, issued May 1 1990, which is hereby included by reference) to 
computer-type network technology, such as Ethernet (see, for example, Metcalfe, R.M., 
and Boggs, D.R. "Ethernet: Distributed Packet Switching for Local Computer 
Networks", Communications of the ACM, Volume 19, Number 7, pp395-404, July 1976 
which is hereby included by reference). The end result of this multiplexing is that the 
data from the entire array is available in a small number of cables, or even a single 
cable, in a manner such that the samples from each individual microphone may be 
separated for the required spatial processing as shown in Figure 9. 

[0073] Figure 17 shows the extension of this sort of embodiment to the 
microphone array "fabric". In this embodiment, power is fed to each 
transducer/processor/multiplexor node via alternating vertical positive and negative 
supply wires. 

[0074] Each oval, such as 1701, represents a complete transducer, 
preprocessor, and network interface as shown in Figure 16. This figure shows how the 
array may be powered by a vertical array of alternating positive, such as 1711, and 
negative supplies, such as 1713. One rail (e.g. the positive wires like as 171 1) may also 
serve as the medium for the network (or additional wires may be used for the network 
interface) by AC-coupling the data back onto the wire. Similarly, clock distribution to 
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the individual A/D converters may be accomplished by placing the clock itself on one 
of the supply wires. By use of frequency-domain multiplexing, the data can be placed 
on the wire in frequency bands that are well above the clock frequency. 

[0075] Note that the entire array could just as easily be wireless (except 
for the supply rails). Each node could simply broadcast a low-power RF signal that 
could be received and demultiplexed for further processing. Each node would have 
some unique ID in the form of a network address, a dedicated frequency, a dedicated 
time slot, or any other way of identifying the node so that the samples may be 
recovered and related back to the original array position of the node. 

[0076] Any medium of transmission could be used to convey the data 
from the array to the processing elements. For instance, each node could emit digital 
data as light on wavelengths that people can not see. The data could be multiplexed 
either by the wavelength of the individual lights, or by time so that only one node 
transmitted data at a time. 

[0077] Hybrid schemes are also possible. That is, "clusters" of some 
number of nodes in a particular area could be multiplexed together with, say, fiber- 
optic cables used to relay the data from each cluster back to the spatial processing 
equipment. 

[0078] Although the various aspects of the present invention have been 
described with respect to specific exemplary embodiments, it will be understood that 
the invention is entitled to protection within the full scope of the appended claims. 
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